Overview
Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 775 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 92.3 KiB |
| Average record size in memory | 122.0 B |
Variable types
| Categorical | 7 |
|---|---|
| Numeric | 6 |
| Boolean | 3 |
adult_male is highly overall correlated with alive and 3 other fields | High correlation |
age is highly overall correlated with age_standardized and 1 other fields | High correlation |
age_standardized is highly overall correlated with age and 1 other fields | High correlation |
alive is highly overall correlated with adult_male and 3 other fields | High correlation |
alone is highly overall correlated with parch and 1 other fields | High correlation |
class is highly overall correlated with pclass | High correlation |
embark_town is highly overall correlated with embarked | High correlation |
embarked is highly overall correlated with embark_town | High correlation |
fare is highly overall correlated with fare_normalized | High correlation |
fare_normalized is highly overall correlated with fare | High correlation |
parch is highly overall correlated with alone | High correlation |
pclass is highly overall correlated with class | High correlation |
sex is highly overall correlated with adult_male and 3 other fields | High correlation |
sibsp is highly overall correlated with alone | High correlation |
survived is highly overall correlated with adult_male and 3 other fields | High correlation |
who is highly overall correlated with adult_male and 5 other fields | High correlation |
sibsp has 508 (65.5%) zeros | Zeros |
parch has 571 (73.7%) zeros | Zeros |
fare has 9 (1.2%) zeros | Zeros |
fare_normalized has 9 (1.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-14 01:34:11.304089 |
|---|---|
| Analysis finished | 2025-12-14 01:34:12.826792 |
| Duration | 1.52 second |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
survived
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 455 | |
| 1 | 320 |
pclass
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 401 | |
| 1 | 210 | |
| 2 | 164 |
sex
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7535484 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 483 | |
| female | 292 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 483 | |
| female | 292 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1067 | |
| m | 775 | |
| a | 775 | |
| l | 775 | |
| f | 292 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3684 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1067 | |
| m | 775 | |
| a | 775 | |
| l | 775 | |
| f | 292 | 7.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3684 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1067 | |
| m | 775 | |
| a | 775 | |
| l | 775 | |
| f | 292 | 7.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3684 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1067 | |
| m | 775 | |
| a | 775 | |
| l | 775 | |
| f | 292 | 7.9% |
age
Real number (ℝ)
High correlation
| Distinct | 88 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.581187 |
| Minimum | 0.42 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0.42 |
|---|---|
| 5-th percentile | 4.7 |
| Q1 | 21 |
| median | 28 |
| Q3 | 36 |
| 95-th percentile | 55.15 |
| Maximum | 80 |
| Range | 79.58 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 13.766359 |
|---|---|
| Coefficient of variation (CV) | 0.46537546 |
| Kurtosis | 0.56732321 |
| Mean | 29.581187 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.44198678 |
| Sum | 22925.42 |
| Variance | 189.51263 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 121 | 15.6% |
| 24 | 29 | 3.7% |
| 18 | 25 | 3.2% |
| 22 | 24 | 3.1% |
| 19 | 23 | 3.0% |
| 21 | 22 | 2.8% |
| 30 | 22 | 2.8% |
| 36 | 21 | 2.7% |
| 25 | 20 | 2.6% |
| 29 | 19 | 2.5% |
| Other values (78) | 449 |
| Value | Count | Frequency (%) |
| 0.42 | 1 | 0.1% |
| 0.67 | 1 | 0.1% |
| 0.75 | 1 | 0.1% |
| 0.83 | 2 | 0.3% |
| 0.92 | 1 | 0.1% |
| 1 | 7 | |
| 2 | 10 | |
| 3 | 6 | |
| 4 | 10 | |
| 5 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 80 | 1 | 0.1% |
| 74 | 1 | 0.1% |
| 71 | 2 | |
| 70.5 | 1 | 0.1% |
| 70 | 2 | |
| 66 | 1 | 0.1% |
| 65 | 3 | |
| 64 | 2 | |
| 63 | 2 | |
| 62 | 3 |
sibsp
Real number (ℝ)
High correlation Zeros
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.52903226 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 508 |
| Zeros (%) | 65.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2.3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9903258 |
|---|---|
| Coefficient of variation (CV) | 1.8719573 |
| Kurtosis | 12.608666 |
| Mean | 0.52903226 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.0360781 |
| Sum | 410 |
| Variance | 0.98074519 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 508 | |
| 1 | 201 | 25.9% |
| 2 | 27 | 3.5% |
| 4 | 18 | 2.3% |
| 3 | 14 | 1.8% |
| 5 | 5 | 0.6% |
| 8 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 508 | |
| 1 | 201 | 25.9% |
| 2 | 27 | 3.5% |
| 3 | 14 | 1.8% |
| 4 | 18 | 2.3% |
| 5 | 5 | 0.6% |
| 8 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 8 | 2 | 0.3% |
| 5 | 5 | 0.6% |
| 4 | 18 | 2.3% |
| 3 | 14 | 1.8% |
| 2 | 27 | 3.5% |
| 1 | 201 | 25.9% |
| 0 | 508 |
parch
Real number (ℝ)
High correlation Zeros
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.42064516 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 571 |
| Zeros (%) | 73.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.84056527 |
|---|---|
| Coefficient of variation (CV) | 1.9982763 |
| Kurtosis | 8.8375634 |
| Mean | 0.42064516 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.6133475 |
| Sum | 326 |
| Variance | 0.70654997 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 114 | 14.7% |
| 2 | 75 | 9.7% |
| 5 | 5 | 0.6% |
| 3 | 5 | 0.6% |
| 4 | 4 | 0.5% |
| 6 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 571 | |
| 1 | 114 | 14.7% |
| 2 | 75 | 9.7% |
| 3 | 5 | 0.6% |
| 4 | 4 | 0.5% |
| 5 | 5 | 0.6% |
| 6 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | 0.1% |
| 5 | 5 | 0.6% |
| 4 | 4 | 0.5% |
| 3 | 5 | 0.6% |
| 2 | 75 | 9.7% |
| 1 | 114 | 14.7% |
| 0 | 571 |
fare
Real number (ℝ)
High correlation Zeros
| Distinct | 248 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.878403 |
| Minimum | 0 |
|---|---|
| Maximum | 512.3292 |
| Zeros | 9 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.225 |
| Q1 | 8.05 |
| median | 15.9 |
| Q3 | 34.1979 |
| 95-th percentile | 120 |
| Maximum | 512.3292 |
| Range | 512.3292 |
| Interquartile range (IQR) | 26.1479 |
Descriptive statistics
| Standard deviation | 52.408474 |
|---|---|
| Coefficient of variation (CV) | 1.5026053 |
| Kurtosis | 29.905898 |
| Mean | 34.878403 |
| Median Absolute Deviation (MAD) | 8.3792 |
| Skewness | 4.5499504 |
| Sum | 27030.762 |
| Variance | 2746.6481 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 31 | 4.0% |
| 26 | 30 | 3.9% |
| 8.05 | 25 | 3.2% |
| 10.5 | 23 | 3.0% |
| 7.75 | 20 | 2.6% |
| 7.8958 | 19 | 2.5% |
| 7.925 | 16 | 2.1% |
| 7.775 | 16 | 2.1% |
| 26.55 | 13 | 1.7% |
| 7.8542 | 12 | 1.5% |
| Other values (238) | 570 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 4.0125 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 6.2375 | 1 | 0.1% |
| 6.4375 | 1 | 0.1% |
| 6.45 | 1 | 0.1% |
| 6.4958 | 2 | 0.3% |
| 6.75 | 2 | 0.3% |
| 6.8583 | 1 | 0.1% |
| 6.95 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 512.3292 | 3 | |
| 263 | 4 | |
| 262.375 | 2 | |
| 247.5208 | 2 | |
| 227.525 | 4 | |
| 221.7792 | 1 | 0.1% |
| 211.5 | 1 | 0.1% |
| 211.3375 | 3 | |
| 164.8667 | 2 | |
| 153.4625 | 3 |
embarked
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| S | |
|---|---|
| C | |
| Q |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | C |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 562 | |
| C | 155 | 20.0% |
| Q | 58 | 7.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 562 | |
| c | 155 | 20.0% |
| q | 58 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 562 | |
| C | 155 | 20.0% |
| Q | 58 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 562 | |
| C | 155 | 20.0% |
| Q | 58 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 562 | |
| C | 155 | 20.0% |
| Q | 58 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 775 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 562 | |
| C | 155 | 20.0% |
| Q | 58 | 7.5% |
class
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| Third | |
|---|---|
| First | |
| Second |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.2116129 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Third |
|---|---|
| 2nd row | First |
| 3rd row | Third |
| 4th row | First |
| 5th row | Third |
Common Values
| Value | Count | Frequency (%) |
| Third | 401 | |
| First | 210 | |
| Second | 164 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| third | 401 | |
| first | 210 | |
| second | 164 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 611 | |
| r | 611 | |
| d | 565 | |
| T | 401 | |
| h | 401 | |
| F | 210 | 5.2% |
| s | 210 | 5.2% |
| t | 210 | 5.2% |
| S | 164 | 4.1% |
| e | 164 | 4.1% |
| Other values (3) | 492 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4039 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 611 | |
| r | 611 | |
| d | 565 | |
| T | 401 | |
| h | 401 | |
| F | 210 | 5.2% |
| s | 210 | 5.2% |
| t | 210 | 5.2% |
| S | 164 | 4.1% |
| e | 164 | 4.1% |
| Other values (3) | 492 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4039 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 611 | |
| r | 611 | |
| d | 565 | |
| T | 401 | |
| h | 401 | |
| F | 210 | 5.2% |
| s | 210 | 5.2% |
| t | 210 | 5.2% |
| S | 164 | 4.1% |
| e | 164 | 4.1% |
| Other values (3) | 492 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4039 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 611 | |
| r | 611 | |
| d | 565 | |
| T | 401 | |
| h | 401 | |
| F | 210 | 5.2% |
| s | 210 | 5.2% |
| t | 210 | 5.2% |
| S | 164 | 4.1% |
| e | 164 | 4.1% |
| Other values (3) | 492 |
who
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| man | |
|---|---|
| woman | |
| child |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.8567742 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | man |
|---|---|
| 2nd row | woman |
| 3rd row | woman |
| 4th row | woman |
| 5th row | man |
Common Values
| Value | Count | Frequency (%) |
| man | 443 | |
| woman | 250 | |
| child | 82 | 10.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| man | 443 | |
| woman | 250 | |
| child | 82 | 10.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 693 | |
| a | 693 | |
| n | 693 | |
| w | 250 | 8.4% |
| o | 250 | 8.4% |
| c | 82 | 2.7% |
| h | 82 | 2.7% |
| i | 82 | 2.7% |
| l | 82 | 2.7% |
| d | 82 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2989 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| m | 693 | |
| a | 693 | |
| n | 693 | |
| w | 250 | 8.4% |
| o | 250 | 8.4% |
| c | 82 | 2.7% |
| h | 82 | 2.7% |
| i | 82 | 2.7% |
| l | 82 | 2.7% |
| d | 82 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2989 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| m | 693 | |
| a | 693 | |
| n | 693 | |
| w | 250 | 8.4% |
| o | 250 | 8.4% |
| c | 82 | 2.7% |
| h | 82 | 2.7% |
| i | 82 | 2.7% |
| l | 82 | 2.7% |
| d | 82 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2989 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| m | 693 | |
| a | 693 | |
| n | 693 | |
| w | 250 | 8.4% |
| o | 250 | 8.4% |
| c | 82 | 2.7% |
| h | 82 | 2.7% |
| i | 82 | 2.7% |
| l | 82 | 2.7% |
| d | 82 | 2.7% |
adult_male
Boolean
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 443 | |
| False | 332 |
embark_town
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.1 KiB |
| Southampton | |
|---|---|
| Cherbourg | |
| Queenstown |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.525161 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Southampton |
|---|---|
| 2nd row | Cherbourg |
| 3rd row | Southampton |
| 4th row | Southampton |
| 5th row | Southampton |
Common Values
| Value | Count | Frequency (%) |
| Southampton | 562 | |
| Cherbourg | 155 | 20.0% |
| Queenstown | 58 | 7.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| southampton | 562 | |
| cherbourg | 155 | 20.0% |
| queenstown | 58 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1337 | |
| t | 1182 | |
| u | 775 | |
| h | 717 | |
| n | 678 | |
| p | 562 | |
| S | 562 | |
| m | 562 | |
| a | 562 | |
| r | 310 | 3.8% |
| Other values (7) | 910 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8157 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1337 | |
| t | 1182 | |
| u | 775 | |
| h | 717 | |
| n | 678 | |
| p | 562 | |
| S | 562 | |
| m | 562 | |
| a | 562 | |
| r | 310 | 3.8% |
| Other values (7) | 910 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8157 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1337 | |
| t | 1182 | |
| u | 775 | |
| h | 717 | |
| n | 678 | |
| p | 562 | |
| S | 562 | |
| m | 562 | |
| a | 562 | |
| r | 310 | 3.8% |
| Other values (7) | 910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8157 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1337 | |
| t | 1182 | |
| u | 775 | |
| h | 717 | |
| n | 678 | |
| p | 562 | |
| S | 562 | |
| m | 562 | |
| a | 562 | |
| r | 310 | 3.8% |
| Other values (7) | 910 |
alive
Boolean
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 455 | |
| True | 320 |
alone
Boolean
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.8 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 437 | |
| False | 338 |
fare_normalized
Real number (ℝ)
High correlation Zeros
| Distinct | 248 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.068078109 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 9 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.014102261 |
| Q1 | 0.015712554 |
| median | 0.031034733 |
| Q3 | 0.066749855 |
| 95-th percentile | 0.2342244 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.051037302 |
Descriptive statistics
| Standard deviation | 0.10229453 |
|---|---|
| Coefficient of variation (CV) | 1.5026053 |
| Kurtosis | 29.905898 |
| Mean | 0.068078109 |
| Median Absolute Deviation (MAD) | 0.016355109 |
| Skewness | 4.5499504 |
| Sum | 52.760534 |
| Variance | 0.01046417 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.02537431011 | 31 | 4.0% |
| 0.05074862022 | 30 | 3.9% |
| 0.01571255357 | 25 | 3.2% |
| 0.02049463509 | 23 | 3.0% |
| 0.01512699257 | 20 | 2.6% |
| 0.01541157521 | 19 | 2.5% |
| 0.01546856982 | 16 | 2.1% |
| 0.01517578932 | 16 | 2.1% |
| 0.05182214873 | 13 | 1.7% |
| 0.01533037742 | 12 | 1.5% |
| Other values (238) | 570 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 0.007831878409 | 1 | 0.1% |
| 0.009759350043 | 1 | 0.1% |
| 0.01217478918 | 1 | 0.1% |
| 0.01256516318 | 1 | 0.1% |
| 0.01258956156 | 1 | 0.1% |
| 0.0126789572 | 2 | 0.3% |
| 0.01317512256 | 2 | 0.3% |
| 0.01338651008 | 1 | 0.1% |
| 0.01356549656 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 0.5133418123 | 4 | |
| 0.5121218935 | 2 | |
| 0.483128426 | 2 | |
| 0.4440992237 | 4 | |
| 0.432884169 | 1 | 0.1% |
| 0.4128205068 | 1 | 0.1% |
| 0.4125033279 | 3 | |
| 0.3217983671 | 2 | |
| 0.2995388512 | 3 |
age_standardized
Real number (ℝ)
High correlation
| Distinct | 88 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3837563 × 10-16 |
| Minimum | -2.1196614 |
|---|---|
| Maximum | 3.6648306 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 459 |
| Negative (%) | 59.2% |
| Memory size | 12.1 KiB |
Quantile statistics
| Minimum | -2.1196614 |
|---|---|
| 5-th percentile | -1.8085578 |
| Q1 | -0.62374728 |
| median | -0.11493295 |
| Q3 | 0.46656914 |
| 95-th percentile | 1.8585398 |
| Maximum | 3.6648306 |
| Range | 5.7844921 |
| Interquartile range (IQR) | 1.0903164 |
Descriptive statistics
| Standard deviation | 1.0006458 |
|---|---|
| Coefficient of variation (CV) | 4.1977689 × 1015 |
| Kurtosis | 0.56732321 |
| Mean | 2.3837563 × 10-16 |
| Median Absolute Deviation (MAD) | 0.50881433 |
| Skewness | 0.44198678 |
| Sum | 1.9184654 × 10-13 |
| Variance | 1.001292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.1149329506 | 121 | 15.6% |
| -0.4056839965 | 29 | 3.7% |
| -0.8418105654 | 25 | 3.2% |
| -0.5510595195 | 24 | 3.1% |
| -0.7691228039 | 23 | 3.0% |
| -0.623747281 | 22 | 2.8% |
| 0.03044257242 | 22 | 2.8% |
| 0.4665691413 | 21 | 2.7% |
| -0.332996235 | 20 | 2.6% |
| -0.04224518907 | 19 | 2.5% |
| Other values (78) | 449 |
| Value | Count | Frequency (%) |
| -2.119661412 | 1 | 0.1% |
| -2.101489472 | 1 | 0.1% |
| -2.095674451 | 1 | 0.1% |
| -2.08985943 | 2 | 0.3% |
| -2.083317532 | 1 | 0.1% |
| -2.077502511 | 7 | |
| -2.004814749 | 10 | |
| -1.932126988 | 6 | |
| -1.859439226 | 10 | |
| -1.786751465 | 4 | 0.5% |
| Value | Count | Frequency (%) |
| 3.664830647 | 1 | 0.1% |
| 3.228704078 | 1 | 0.1% |
| 3.010640793 | 2 | |
| 2.974296913 | 1 | 0.1% |
| 2.937953032 | 2 | |
| 2.647201986 | 1 | 0.1% |
| 2.574514224 | 3 | |
| 2.501826463 | 2 | |
| 2.429138702 | 2 | |
| 2.35645094 | 3 |
Interactions
Correlations
| adult_male | age | age_standardized | alive | alone | class | embark_town | embarked | fare | fare_normalized | parch | pclass | sex | sibsp | survived | who | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| adult_male | 1.000 | 0.365 | 0.365 | 0.526 | 0.381 | 0.076 | 0.071 | 0.071 | 0.149 | 0.149 | 0.388 | 0.076 | 0.895 | 0.299 | 0.526 | 0.999 |
| age | 0.365 | 1.000 | 1.000 | 0.137 | 0.330 | 0.250 | 0.137 | 0.137 | 0.126 | 0.126 | -0.237 | 0.250 | 0.083 | -0.169 | 0.137 | 0.650 |
| age_standardized | 0.365 | 1.000 | 1.000 | 0.137 | 0.330 | 0.250 | 0.137 | 0.137 | 0.126 | 0.126 | -0.237 | 0.250 | 0.083 | -0.169 | 0.137 | 0.650 |
| alive | 0.526 | 0.137 | 0.137 | 1.000 | 0.170 | 0.331 | 0.163 | 0.163 | 0.284 | 0.284 | 0.146 | 0.331 | 0.512 | 0.154 | 0.997 | 0.536 |
| alone | 0.381 | 0.330 | 0.330 | 0.170 | 1.000 | 0.105 | 0.100 | 0.100 | 0.287 | 0.287 | 0.674 | 0.105 | 0.272 | 0.820 | 0.170 | 0.433 |
| class | 0.076 | 0.250 | 0.250 | 0.331 | 0.105 | 1.000 | 0.252 | 0.252 | 0.496 | 0.496 | 0.029 | 1.000 | 0.118 | 0.142 | 0.331 | 0.144 |
| embark_town | 0.071 | 0.137 | 0.137 | 0.163 | 0.100 | 0.252 | 1.000 | 1.000 | 0.198 | 0.198 | 0.011 | 0.252 | 0.086 | 0.092 | 0.163 | 0.049 |
| embarked | 0.071 | 0.137 | 0.137 | 0.163 | 0.100 | 0.252 | 1.000 | 1.000 | 0.198 | 0.198 | 0.011 | 0.252 | 0.086 | 0.092 | 0.163 | 0.049 |
| fare | 0.149 | 0.126 | 0.126 | 0.284 | 0.287 | 0.496 | 0.198 | 0.198 | 1.000 | 1.000 | 0.380 | 0.496 | 0.181 | 0.412 | 0.284 | 0.158 |
| fare_normalized | 0.149 | 0.126 | 0.126 | 0.284 | 0.287 | 0.496 | 0.198 | 0.198 | 1.000 | 1.000 | 0.380 | 0.496 | 0.181 | 0.412 | 0.284 | 0.158 |
| parch | 0.388 | -0.237 | -0.237 | 0.146 | 0.674 | 0.029 | 0.011 | 0.011 | 0.380 | 0.380 | 1.000 | 0.029 | 0.232 | 0.413 | 0.146 | 0.385 |
| pclass | 0.076 | 0.250 | 0.250 | 0.331 | 0.105 | 1.000 | 0.252 | 0.252 | 0.496 | 0.496 | 0.029 | 1.000 | 0.118 | 0.142 | 0.331 | 0.144 |
| sex | 0.895 | 0.083 | 0.083 | 0.512 | 0.272 | 0.118 | 0.086 | 0.086 | 0.181 | 0.181 | 0.232 | 0.118 | 1.000 | 0.169 | 0.512 | 0.941 |
| sibsp | 0.299 | -0.169 | -0.169 | 0.154 | 0.820 | 0.142 | 0.092 | 0.092 | 0.412 | 0.412 | 0.413 | 0.142 | 0.169 | 1.000 | 0.154 | 0.364 |
| survived | 0.526 | 0.137 | 0.137 | 0.997 | 0.170 | 0.331 | 0.163 | 0.163 | 0.284 | 0.284 | 0.146 | 0.331 | 0.512 | 0.154 | 1.000 | 0.536 |
| who | 0.999 | 0.650 | 0.650 | 0.536 | 0.433 | 0.144 | 0.049 | 0.049 | 0.158 | 0.158 | 0.385 | 0.144 | 0.941 | 0.364 | 0.536 | 1.000 |
Missing values
Sample
| survived | pclass | sex | age | sibsp | parch | fare | embarked | class | who | adult_male | embark_town | alive | alone | fare_normalized | age_standardized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 3 | male | 22.0 | 1 | 0 | 7.2500 | S | Third | man | True | Southampton | no | False | 0.014151 | -0.551060 |
| 1 | 1 | 1 | female | 38.0 | 1 | 0 | 71.2833 | C | First | woman | False | Cherbourg | yes | False | 0.139136 | 0.611945 |
| 2 | 1 | 3 | female | 26.0 | 0 | 0 | 7.9250 | S | Third | woman | False | Southampton | yes | True | 0.015469 | -0.260308 |
| 3 | 1 | 1 | female | 35.0 | 1 | 0 | 53.1000 | S | First | woman | False | Southampton | yes | False | 0.103644 | 0.393881 |
| 4 | 0 | 3 | male | 35.0 | 0 | 0 | 8.0500 | S | Third | man | True | Southampton | no | True | 0.015713 | 0.393881 |
| 5 | 0 | 3 | male | 28.0 | 0 | 0 | 8.4583 | Q | Third | man | True | Queenstown | no | True | 0.016510 | -0.114933 |
| 6 | 0 | 1 | male | 54.0 | 0 | 0 | 51.8625 | S | First | man | True | Southampton | no | True | 0.101229 | 1.774949 |
| 7 | 0 | 3 | male | 2.0 | 3 | 1 | 21.0750 | S | Third | child | False | Southampton | no | False | 0.041136 | -2.004815 |
| 8 | 1 | 3 | female | 27.0 | 0 | 2 | 11.1333 | S | Third | woman | False | Southampton | yes | False | 0.021731 | -0.187621 |
| 9 | 1 | 2 | female | 14.0 | 1 | 0 | 30.0708 | C | Second | child | False | Cherbourg | yes | False | 0.058694 | -1.132562 |
| survived | pclass | sex | age | sibsp | parch | fare | embarked | class | who | adult_male | embark_town | alive | alone | fare_normalized | age_standardized | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 879 | 1 | 1 | female | 56.0 | 0 | 1 | 83.1583 | C | First | woman | False | Cherbourg | yes | False | 0.162314 | 1.920324 |
| 880 | 1 | 2 | female | 25.0 | 0 | 1 | 26.0000 | S | Second | woman | False | Southampton | yes | False | 0.050749 | -0.332996 |
| 881 | 0 | 3 | male | 33.0 | 0 | 0 | 7.8958 | S | Third | man | True | Southampton | no | True | 0.015412 | 0.248506 |
| 882 | 0 | 3 | female | 22.0 | 0 | 0 | 10.5167 | S | Third | woman | False | Southampton | no | True | 0.020527 | -0.551060 |
| 883 | 0 | 2 | male | 28.0 | 0 | 0 | 10.5000 | S | Second | man | True | Southampton | no | True | 0.020495 | -0.114933 |
| 885 | 0 | 3 | female | 39.0 | 0 | 5 | 29.1250 | Q | Third | woman | False | Queenstown | no | False | 0.056848 | 0.684632 |
| 887 | 1 | 1 | female | 19.0 | 0 | 0 | 30.0000 | S | First | woman | False | Southampton | yes | True | 0.058556 | -0.769123 |
| 888 | 0 | 3 | female | 28.0 | 1 | 2 | 23.4500 | S | Third | woman | False | Southampton | no | False | 0.045771 | -0.114933 |
| 889 | 1 | 1 | male | 26.0 | 0 | 0 | 30.0000 | C | First | man | True | Cherbourg | yes | True | 0.058556 | -0.260308 |
| 890 | 0 | 3 | male | 32.0 | 0 | 0 | 7.7500 | Q | Third | man | True | Queenstown | no | True | 0.015127 | 0.175818 |